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ABSTRACT 

It was recently concluded that experiments "indicate 
that relatively brief interactions with a commercial game can cause a 
significant improvement in children's performance on an IQ subtest" 
(Dirks, 1982). The present paper makes three points. First, Dirks' 
analyses were not appropriate to the research question, and hence 
provide no rigorous evidence justifying the conclusion. Second, even 
if the analyses had been appropriate, the most the experiment could 
possibly have demonstrated was that a game nearly identical to the 
Wechsler Intelligence Scale for Children — Revised (WISC-R) Block 
Design subtest could elevate subjects' scores on the subtest. Third, 
by employing bwo games of less but still substantial similarity to 
block design tasks, the present experiment found no evidence of 
positive transfer to the Block Design Subtest of the Wechsler 
Preschool Primary Scale of Intelligence (WPPSI). Taken together, it 
would appear that, for the present, there is no rigorous evidence 
that games of any degree of similarity other than identity 
(test-retest effect) affect scores on the WISC-R or WPPSI subtests. 
(Author/RH) 
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ABSTRACT 



It was recently concluded that experiments "indicate 
that relatively brief interactions with a commercial game can 
cause a significant improvement in children's performance on 
an IQ subtest" (Dirks, 1932, p. 109). The present paper 
makes three points. First, Dirks'' analyses were not 
appropriate to the question, and hence provide no rigorous 
evidence justifying the conclusion. Second, e'jen if the 
analyses had been appropriate, the most the experiment could 
possibly have demonstrated was that a g^i^me nearly identical 
to the UHSC-R Block Design subtest could elevate its score, 
ih.^d, by employing two games of less but still substantial 
similarity, the present experiment finds no evidence of 
positive transfer on the Block Design subtest of the WPPSI . 
Taken together, it would appear that, for the present, there 
is no rigorous evidence that games of any degree of 
similarity other than identity (test-retest effect) affect 
scores on the UIISC-R or UIPPSI subtests. 
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The present experiment is concerned with the e-f-fect o-f 
play behavior on related IQ-test item per -f ormance • The 
impetus -for the study comes -from two contrary themes. On the 
one hand, experimental intervention studies during the 
preschool years have not been notably successful in changing 

not 

IQ-test scoresy^even the more prestigious large-sample 
longitudinal studies (e.g., Lazar and Darlington, 1932). On 
.the other hand, there are reports that home play experiences 
do affect these scores (e.g., Stevenson, Parker, Wilkinson, 
Bonnevaux, and Gonzalez, 1?73). Among the latter studies is 
one that reports what at first glance looks 1 ike a startl i ng 
result, that 30 minutes of exposure to a 
commercially-produced game similar to the Block Design 
subtest of the UIISC-R results in significant improvement on 
that subtest (Dirks, 1?S2). Closer reading, however, reveals 
that the game, "Trac 4", is almost identical to the Block 
Design subtest. That is, red and white cubes, with some 
sides divided along the diagonal , are to be sorted to match a 
geometric pattern depicted in a pictured square model. Thus 
the same material (blocks) with the same colors (red and 
white) with the same patterns (horizontal and diagona.l color 
divisions) are used to seek the same goal (pattern matching). 
Furthermore, response time is measured in both tasks. "So it 
is possible to conclude simply that 30 minutes of practice on 
a Block Design-like game improves performance on the Block 
Des i gn subtest . 

This is not a particularly surprising finding. An 
interesting extension of the study would be to vary the 



similarity of the game and the test item, and thereby to get 
some idea o-f the -functional relationship between game-test 
similarity and test-score gain. The Block Design subtest is 
a par t i cu 1 ar 1 y good choice -for such a study, because there is 
considerable evidence that test items which require discovery 
o-f -figural relationships are more highly g-loaded than any 
other type o-f item (e.g., Jensen, 1930, p. 229). In the 
present study, then, block pi ay was provided to two 
experimental groups, and pre-post test scores were compared 
to those o-f a control group. The two experimental groups 
varied in the similarity o-f the block material to the test 
ma ter i al . 



METHOD 



Subjects and IQ test. Subjects were 37 children with 
ages between 3 years, 10 months and 6 years, 2 months, with a 
mean o-f 59.4 months. All subjects attended a day-care center 
operated by the University o-f Southern California. Ethnic 
background was diver se, and the -families in general were 
middle class. The 19 girls and 13 boys comprising the sample 
were selected with two criteria in mind: <1) they expressed a 
desire to "play games" with the experimenter, and <2) they 
met the age restrictions o-f the Ulechsl er Preschool and 
Primary Scale o-f Intelligence (UIPPSI). 

With preschool children, the appropriate Wechsler IQ 
test is the Wechsler Preschool and Primary Scale o-f 



Intelligence (UIPPSI). The Block Design subtest o-f the UIPPSI 
is similar to that o-f the UlISC, with a -few minor di -f -f erences , 
the largest o-f which is the use o-f 2-d i men s i onal blocks 
instead o-f 3-d i men s i onal blocks. Other ingredients ace 
approximately the same: red and white colors divided on the 
horizontal and diagonal, instructions to copy a design, speed 
i nstruc t i ons, e tc . 

Pes i gn , The subj ec t s wer e random ] y assigned to three 
groups, two experimental groups <N = 11 and 12), and a 
control group <N = 14). One experimental group was exposed 
to red and white blocks with at least one side o-f each block 
diagonally divided into red and white sections. These 
patterns are the same as the patterns on the block design 
subtest of the WPPSI . The other experimental group was 
exposed to similar blocks, but they were blue and yellow, and 
were divided horizontally instead o-f diagonally. The control 
group was not exposed to blocks at all, but rather" to typical 
preschool material such as color books, etc. The purpose o-f 
this group was to control -for other -factors (-familiarity with 
the experimenters, practice at -following directions, etc.) 
which might possibly produce positive trans-fer. 

Mater i al s . The critical training stimuli were the 
blocks, which were made o-f wood (small blocks) or cardboard 
(large blocks). They were non-uni-form in shape and size, 
their shapes being either square or rectangular, and their 
dimensions ranging -from about 1 in per side to 12 in per 
side. Blocks -for the red-white experimental group were 
covered with red and whit'/ contact paper with at least one 



sur-face per block divided diagonally into red and white 
sectors, and other sur-faces being either red or white. 
Blocks -for the blue-yellow experimental group were covered 
with blue and yellow contact paper with at least one side 
divided horizontally into blue and yellow sectors. There 
were 32 blocks -for each condition. Materials -for the control 
group were two commercial "workbooks" designed to prepare 
children -for Kindergarten, These workbooks contained tracing 
shapes, geometric -figures, coloring -figures, and the like. 
So all groups were exposed to -figural content, but only the 
two exerimental groups manipulated blocks to match a 
standard , 

Pr oce dur e , The two experimental groups were tre^ited 
identically during play periods, except -for the colors of 
their blocks. Subjects in each group were run in pairs in 
order to take advantage o-f peer-elicited activity and to 
increase the -familiarity o-f the situation. Each subject 
participated in a one-hal-f hour session, except that six oS 
the 12 subjects -from the red-white condition were given a 
second one-hal-f hour play session on another day in order to 
explore training-duration e-f-fects. The Dirks study employed 
one one-hal-f hour duration, split into 2 15-minute periods. 

While no explicit instructions were given to the 
subjects, an experimenter was on hand to keep interest 
-focused on the blocks and to urge constructive play; e, g, , 
"Build me a castle" or "See how high a tower you can make". 
The experimenter occasionally participated in order to 
encourage more block play. The play area -for the pairs o-f 
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subjects was 8 x 10 -ft, and no other toys were available 
during the play periods. The accompanying photograph shows 
one of the larger block constructions and its would-be 
ar ch i tec t s . 

Insert photograph about here 

All testing and play sessions took place at the 
subjects^ day-care center in rooms -familiar to the subjects. 
There were three experimenters, all graduate students in 
child development. They had been extensively trained in 
administration of the UIPPSI , and were familiar to the 
subjects as a result of free-play participation during the 
school year • 

All subjects were pre- and postested on the five 
performance subtescs of the UIWPSI : <!> Animal House, a 
measure of learning ability to asscciate sign with symbol; 
<2) Geometric Design, a mec^sure of ability to draw a series 
of pictures <i. e., circle, square, diamond); (3) Picture 
Completion, a measure of ability to identify missing parts of 
pictures of common objects; <4) Mazes, a measure of ability 
to solve pencil maze problems by tracing out an appropriate 
path; and <5) Block Design. 

Pretests were administered approximately one month prior 
to exposure to the play sessions, and postests were 
administered 7 to 10 days later. The logistics of the 
situation made it impossible to schedule blind testing. 
However, experimenters did not necessarily postest the same 
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subject^ they had pretested^ but rather about 1/3 o-f those 
subjects. Experimenters did not see the pretest protocals or 
scores after the initial scoring some 6 weeks earlier. They 
reported little recall, which they attributed to, time and 
interference -from intervening test administrations. 

RESULTS 

Table 1 presents means and standard deviations -for 
scaled scores on all -five performance subtests, separately 
for pretest and postest, and separately for treatment 
conditions. The two nested conditions within the 
experimental condition (one-half hour vs. one hour exposure) 
are col lap d, as there was not the slightest suggestion of a 
difference in outcome. 

Insert Table 1 about here 

A glance at the grand means at the right column of Table 
1 indicates a gain from pretest to postest for all five 
subtests. The cell contents show that this gain held for all 
experimental conditions, with no exceptions. A second effect 
indicated by the grand means is that the gains for Picture 
Completion and Blocks Design (about 2 units) are more than 
twice as large as the gains for any of the other three tests, 
for which the average gain is about 0.8 units. Third, 
scrutiny of the table contents reveals no other systematic 
patterns. In particular, it is not the case that the 
red-white condition produced the greatest gain for Block 



Desi gn . 

These data were submitted to an unweighted-means 
analysis of variance <ANOUA) with experimental condition as a 
group -factor '3 levels), and pre-postest <2) and subtests <5) 
as two repeated-measures factors. The results of this 
analysis are totally redundant with the observations just 
made: there were two significant effects, pre-postest gain, 
_F<1,34) = 43.3, _p < .001, and subtests by pre-postest 
interaction, _F<4, 136) = 9.7,_p_< .02. Of particular 
importance is the nonsignificant triple interaction, which 
reflects no treatment differences in relative gain among the 
five subtests, _F<3, 136) < 1.0. 

These results can perhaps be more fully comprehended by 
looking at Figure 1, which presents the mean pre-postest 
difference for each treatment and for each subtest. It is 
clear from this figure that the red-white treatment did not 
produce a greater gain for Block Designs than for any other 
subtest, since there is a greater gain for Picture 
Completion. It is also clear that the red-white treatment 
did not produce the greatest gain for Block Designs, since 
the blue-yellow condition produced a greater gain. It is 
unlikely this result is due to chance group differences in 
ability to profit from the pretest administration, because 
the blue-yellow sub j ec t s .ga i ned less than the red-white 
subjects on two of the remaining four subtests. 

Insert Figure 1 about here 
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Block Design scores by themselves vjere also submitted to 
ANOl^A, with a single group -factor < treiitmcrn ts) and a single 
repeated-measurement -factor <pre-postest ) . I -f there were 
signi-ficant di -f -f erences in treatment gain scorps on the 
Blocks Design subtest, a signi-ficant interaction between the 
two -factors would be expected. However, F<2/34) = 1.4; p > 
• 20. So there is no indication that the three treatment 
conditions produced dif-ferent gain scores fcr- Bl ock Des-i gn . 

DISCUSSION 

The present results suggest caution in assigning a 
causative role to game experience in boosting IQ-scores when 
the game-subtest similarity is less than near per-fect. The 
present study -found no evidence o-f trans-fer on the block 
design subtest when it was compared to transfer on other 
subtests unrelated to the experimental game. 

Upon closer examination, it turns out that the Dirks 
study was also unable to produce relative positive trans-fer. 
Positive trans-fer was analyzed, but not positive trans-fer 
relative to that o-f other subtests.. This omission is 
puzzling because the exper imen t^U design provided information 
■for such an analysis. That is, pre-post test changes on the 
block design subtest as a -function o-f game or no game 
experience should have been compared to the same changes on 
the Object Assembly subtest, the other subtest administered 
by Dirks. Such an ANO^A would have evaluated the three 
factors of game vs. no game, pretest vs. post test, and Block 
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Design vs. Object Assembly subtests. But two 2x2 ANOVAS 
are reported instead, one -for each ta«.l<. 

The seriousness o-f this omission can be appreciated by a 
re-examination o-f Dirks^ Figure 2, which presents the 
pre-test and post-test mean scores -for the Block Design 
subtest alone (see our Figure 2, solid lines). A signi-^icant 
2-way interaction is reported -for these data, indicating that 
the post-test gain is greater for the game condition than -for 
the no game condition. We have added to that -figure the mean 
scores -for the Object AsseTtbly task (dashed lines). These 
means reveal the same 2-way interaction, although o-f lesser 
magnitude. What is not reported is the s i gn i -f i cance o-f the 
dif-ference between these two interactions. There-fore we have 
no way o-f determining i -f the game activity imprcv^^d Block 
Design scores more than Object Av>sembly scores or not. Such 
a conclusion would be war-ranted only i -f t!.? triple 
interaction o-f all three -factors had been -found to be 
si gn i -f i can t . It was the absence o-f such an interaction in 
the present study that led us to accept the null hypothesis. 



INSERT FIG 2 ABOUT HERE 

We suggest, then, that there is little rigorous ev'idence 
that even a nearly identical game produces trans-fer speci-fic 
to the Block Design subtest. Our own results replicate this 
non— finding, and extend it to games o-f lesser similarity. It 
may be the case, o-f course, that other games related to other 
subtests might produce positive trans-fer. It would seem, 
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however, that at least -for the WISC-R and the UPPSI , such a 
possibility remains to be demonstrated. 
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Table 1 

Pre-postest scores on Five UIPPSI Performance Scales for 
three Treatment Conditions 



Test 



Red-Wh i te 
Mean SD 



Bl ue-Ye 1 1 ow 
Mean SD 



Con trol 
Mean SD 



Grand 

Mean 



AH Pre 12.4 2.3 
Post 13.5 3.5 



12.5 1.8 
13.3 1.5 



12.4 2.6 
13.7 3.1 



12.4 
13.5 



PC Pre 11.8 2.3 
Post 14. o 2.2 



13.2 2.4 
14.6 2.4 



11.3 2.9 
13.6 3.2 



12.1 
14.3 



BD Pre 12.9 2.4 
Post 14.'? 1 ."S 



12.1 2.8 
14.7 2.9 



13.5 1.7 

14.6 2.3 



12.8 
14.7 



M Pre 12.8 1.9 
Post 13.3 1.7 



12.7 
14.1 



1 .7 
2.5 



12.6 1.7 
12.9 1.4 



12.7 
13.4 



GD Pre 13.0 3.0 
Post 13.1 2.8 



12.6 
13.5 



2.2 
2.7 



12.7 2.8 
13.4 2.8 



12.8 
13.3 



1. AHiAnimal House; PC-Picture Completion; BD:Block Design; 



M :Mazes* 



GDrGeometric Design 
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Fi gure cap t i ons 



Figure 1. Mean pre-postest difference scores for two 

experimental groups and control group. AH: Animal 
House; PC: Picture Completion; BD: Block Design; 
M: Mazes; GD : Geometric Design. 

Figure 2. Dirks' Figure 2, with addition of Object 
Assembly scores. 
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Block play in the red-white experimental condition 
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